Writing on Dirty Paper with Resizing and its Application to 
Quasi-Static Fading Broadcast Channels 



Wenyi Zhang 
Communication Science Institute 
University of Southern California 
Los Angeles, CA 90089 
Email: wenyizha@usc.edu 



Shivaprasad Kotagiri and J. Nicholas Laneman 
Department of Electrical Engineering 
University of Notre Dame 
Notre Dame, IN 46556 
Email: {skotagir, jlaneman}@nd.edu 



o 
o 

Oh 
< 

O ' 

(N : 



CO 



> 
00 
(N 

o 
o 



X 



Abstract — This paper studies a variant of the classical problem 
of "writing on dirty paper" in which the sum of the input and 
the interference, or dirt, is multiplied by a random variable that 
models resizing, known to the decoder but not to the encoder. 
The achievable rate of Costa's dirty paper coding (DPC) scheme 
is calculated and compared to the case of the decoder's also 
knowing the dirt. In the ergodic case, the corresponding rate 
loss vanishes asymptotically in the limits of both high and low 
signal-to-noise ratio (SNR), and is small at all finite SNR for 
typical distributions like Rayleigh, Rician, and Nakagami. In the 
quasi-static case, the DPC scheme is lossless at all SNR in terms of 
outage probability. Quasi-static fading broadcast channels (BC) 
without transmit channel state information (CSI) are investigated 
as an application of the robustness properties. It is shown that 
the DPC scheme leads to an outage achievable rate region that 
strictly dominates that of time division. 

I. Introduction 

In this paper we consider the following variant of Costa's 
"writing on dirty paper" (WDP) problem [1], 



Y t = y/Ai ■ (Xi + Si) + Z h i = l,. 



,,n, 



(1) 



where the channel noise samples {^}™ =1 are indepen- 
dent, identically distributed (i.i.d.) circular-symmetric complex 
Gaussian with mean zero and variance iV, i.e., Zi ~ CN(0, N) 
for i = 1, . . . , n, and the interference signal samples {<Si}" =1 
are i.i.d. CN(0, Q). The average power of the inputs {Xi}2 =1 
is constrained by 



1 



n 

E 



X,\ 2 <P. 



The non-negative random variables {^}™ =1 , with marginal 
probability density function (PDF) p(a) for a S [0, oo), model 
resizing coefficients or fading coefficients in wireless applica- 
tions. Furthermore, {Ai}f =1 and {Si}f =1 are independent. 
We consider the following two scenarios: 

> Ergodic case: {Ai}™ =1 are n i.i.d. random variables. 

> Quasi-static case: {Ai}f =1 remain constant over the 
entire coding block. 

In both cases we assume that are known to the 

decoder, but not to the encoder. In the context of resizing and 
fading channels this is often a reasonable first-order approxi- 
mation because the decoder may estimate the coefficients with 
satisfactory accuracy, but the transmitter may not for lack of 



an adequate feedback link. As in the original WDP problem, 
{Si}i=i are known to the encoder but not to the decoder. In the 
sequel, for simplicity and without loss of generality, we drop 
the time index i, normalize A such that it has unit expectation 
E[A] = 1, and define the average signal-to-noise ratio (SNR) 
as p := P/N. 

When A = 1 with probability one, the problem reduces to 
the original WDP problem [1]. By introducing the auxiliary 
random variable 



U :=X 



1 



P 



-S. 



(2) 



where the channel input X ~ CN(0, P) and is independent of 
the interference S, the capacity of the channel (Q]) is given by 
[1] 



C = I(U;Y)-I(U;S)=\og(l+p). 



(3) 



The optimality of C is apparent because it is lossless compared 
to the case in which the decoder knows S. In other words, the 
dirty paper coding (DPC) scheme can remove any effect of 
the additive interference. 

The situation changes for general resizing or fading scenar- 
ios in which the coefficient A affects the realized or instanta- 
neous SNR which is unknown to the encoder. Therefore the 
encoder is incapable of dynamically adjusting its precoding to 
obtain optimality. 

In the remainder of this paper, however, we demonstrate that 
the DPC scheme leads to fairly robust performance even for 
resizing or fading channels. We compare the achievable rate of 
the DPC scheme in our setup and the channel capacity when 
the decoder also knows the interference. In the ergodic case 
(Section [TTJ, the DPC scheme turns out to be asymptotically 
lossless in the limits of both high and low SNR. Specifically, 
at high SNR the gap between the two rates vanishes, and at 
low SNR the ratio between the two rates approaches one. In 
the quasi-static case (Section ITITb. the DPC scheme is lossless 
at all SNR in terms of outage probability. 

As one application of the revealed robustness properties, we 
investigate quasi-static fading broadcast channels (BC) without 
transmit channel state information (CSI) (Section lIVl l. It is 
shown that the DPC scheme leads to an outage achievable rate 
region equivalent to the capacity region of a corresponding 



BC without fading. This rate region is, to the best of the 
authors' knowledge, the first that strictly dominates that of 
time division. 

II. Ergodic Case 

A. Some Preliminaries 

From a high-level perspective, the ergodic case falls into the 
category of channel coding with two-sided state information 
[2]. In [2, Theorem 1] the authors consider a discrete memo- 
ryless channel p(y\x, si, S2) with state information (Sij, S^.i) 
i.i.d. ~ p(si, S2), Si non-causally known to the encoder, and 
S2 known to the decoderQ They show that the channel has 
capacity 



C= max [I(U ; S2, Y) — I(U; Si)] . 

p(u,x\si) 



(4) 



We note that the proof of © in [2] requires that the 
alphabets |X|, |§i|, | S2 1 , \M\ be finite. When this condition 
is violated, for example in the WDP problem, provides an 
achievable rate, but it remains unclear if it also provides the 
converse. In this paper, we focus on utilizing (0]i to establish 
achievable rate for the particular class of U generated by the 
DPC scheme. 

As a side note, (0]l may also be interpreted by treating 52 as 
an additional channel output, and adopting an argument similar 
to that in [3] for channels with causal state information. 

We adopt the DPC scheme of [1]. The channel input X 
is i.i.d. CN(0, P), and the auxiliary random variable U = 
X + aS, where a is a constant to be designed. Noting that 
the coefficient A is independent of U, we have an achievable 
rate 

R = I{U; A, Y) - I(U; S) = I(U; Y\A) - I(U; S). (5) 

Following algebraic manipulations similar to those in [1], we 
obtain 

P[A{P + Q) + N] 



R = E 



log- 



(6) 



(1 - a) 2 APQ + {P + a 2 Q)N 

where the expectation is with respect to the distribution p(a) 
of A. 

For every given configuration of (P, Q, N) and p(a), maxi- 
mizing © gives the optimal choice of a and the corresponding 
maximized R. In the following, however, we focus on the 
particular choice of a = p/(p + 1). That is, even though 
the channel is time-varying, we precode for the average SNR 
rather than the instantaneous SNR for each channel use. 

For simplicity we define the interference-to-power ratio 
(IPR) /3 := Q/P. As we let a = p/(p + 1), © becomes 

(p+l) 2 [(l + /3)Ap+l] 



R = E 



log- 



(7) 



{/3+l)p 2 + {(3A + 2)p+l 

When the decoder also knows the interference S, it can 
simply subtract off >/AS, and the capacity is 



C = E[log(l + Ap)], 



(8) 



which provides a performance upper bound for comparison in 
the following. 

B. Asymptotics 

(1) High SNR: As p — > 00, by expanding (0 with respect 
to 1/p, we obtain 

R = logp + EpogA] + P+ ^ A] ■ (VP) + o(l/ p). (9) 
Comparing (O with the high-SNR expansion of (JHJ 

C = \ogp + E[logA] + E[i] ■ (1/p) + o(l/ p), (10) 

we observe that their difference vanishes as p^oo| 

(2) Low SNR: As p — * 0, by expanding © with respect to 
p, we obtain 

R = p- [(2fi+l)E[A 2 ]-2l3\- p 2 +o{p 2 ). (11) 
Comparing (fTTT i with the low-SNR expansion of dHJ 

C = p-E{A 2 }p 2 +o(p 2 ), (12) 
we observe that their ratio approaches one as p — > 0. 

C. Finite SNR Behavior 

For finite SNR, we denote the gap between C and R by A, 
which is 



A 



C -R 



E 



log 



(Ap + 1)[({3 + l)p 2 + (f3A + 2)p+l] 



(p + l) 2 [((3 + l)Ap + l] 
An upper bound to A is easily obtained by letting j3 



00: 



' Since we always consider block decoding, it makes no difference whether 
we assume causal or non-causal state information at the decoder. 



A < A := E [log(p + A) + log(p + l/A)] - 2 \og{p + 1). (13) 

From (1131 , we can evaluate A either analytically or numer- 
ically using Monte Carlo, thereby obtaining an upper bound 
to the possible rate loss of the DPC scheme. For Rayleigh 
fading channels, it is found that A is maximized around 
p = dB, and the maximum value is about 0.384. That 
is, the performance loss due to the DPC scheme is less than 
0.4 nats per channel use, at all SNR. Similarly, we obtain 
numerical results for Rician and Nakagami fading channels. 
For Rician fading channels, the maximum A across SNR 
never exceeds that when the line-of-sight (LOS) component 
is absent, i.e., when the channel is Rayleigh; and is further 
reduced when the LOS component increases, i.e., when the 
channel becomes more Gaussian. For Nakagami channels, the 
maximum A monotonically decreases as the fading figure 
m > 1/2 increases [4], and never exceeds one nat per channel 
use. 

The expansions of (9) and (10} implicitly assume that E[l/A] < 00. When 
this is not the case, these expansions involve fractional-order terms, but the 
conclusion that their difference vanishes as p — * 00 can still be shown to 
hold. 



III. Quasi-Static Case 

In the quasi-static fading case, the coefficient A is one 
realization drawn according to p(a). Using the DPC scheme 
with a independent of A, from [ 1 ] we have that the achievable 
rate is the random variable 

P[A(P + Q) + N] 



J(a, A) := log ■ 



(14) 



(1 - afAPQ + (P + a 2 Q)N' 

induced by A. For every R > 0, we can calculate the outage 
probability Pr [A : J(a,A) < R], i.e., the probability that the 
realization of A makes the achievable rate J(a, A) insufficient 
to support the target rate R. Furthermore, let us adjust a to 
minimize the outage probability for every given R. To this end 
we notice that 



Pr [A : J (a, A) < R] 

(e R - 1)PN 



Pr 



A : A < 



P(P + Q-(l-a) 2 e R Q) 



(15) 



Hence by straightforward manipulations, we find that the 
minimizer of Pr [A : J (a, A) < R] is a* = 1 — e~ R , and that 
the corresponding minimum outage probability is 

P R 

minPr [A : J {a, A) < R] = Pr 



A : A < 



1 



P 



(16) 



where p := P/N as in the previous sections. 
Discussion: 

(1) In view of ( fTSI l, we observe that this minimum outage 
probability coincides with the minimum outage probability 
when the decoder also knows S. That is, in the quasi-static 
case, the DPC scheme is optimal at all SNR, regardless of 
the specific distribution for A. Furthermore, it is important 
to note that the optimal choice of a depends upon the target 
rate R. In fact, we may introduce a virtual SNR p* satisfying 
R = log(l + p*), and rewrite a* = 1 - e~ R = p*/(l + p*). 
That is, for a given target rate, the optimal strategy is to simply 
treat the channel as if it is realized to just be able to support 
this rate. 

Such behavior can be explained by the following coinci- 
dence argument. The conditional achievable rate J(a, A) as 
given by ([141 is a function of two variables, a and A. It can be 
verified that for every a, J(a, A) is monotonically increasing 
with A. On the other hand, the DPC scheme shows that, for 
every A, the optimal choice of a maximizing J(a, A) is given 
by oP PC (A) = Ap/(Ap + 1). Therefore, for a given target 
rate R, if we solve J(a DPC (A), A) = R for its solution A* 
and choose a* = a BPC (A*) in the DPC scheme, we can 
guarantee that for every fading realization A < A*, the target 
rate R is achievable. 

(2) To illustrate how the DPC scheme achieves the same 
outage probability as if there were no interference, we can 
plot the cumulative distribution function (CDF) of J(a,A). 
Assuming that the fading is Rayleigh such that A follows the 
exponential distribution with unit mean, Figure Q] displays the 
CDF for P/N = 10 and Q = P. Plotted are two solid curves 
corresponding to a = 0.3 with R w 0.36 nats and a = 0.7 
with R « 1.20 nats, respectively. It is clear that the CDF 



of the DPC scheme depends upon the particular choice of a. 
The reference CDF, with Q — 0, is plotted as the dashed- 
dot curve. It is clearly illustrated that, for a given target rate 
R, if we choose the corresponding a* = 1 — e~ R , the CDF 
of the DPC scheme tangentially intersects the reference CDF 
exactly at J(a, A) = R, hence resulting in the identical outage 
probability. 




Fig. 1. The CDF of J(a,A) for Rayleigh fading channels. 

(3) Finally we note that the conditional achievable rate 
J(a, A) is negative for a certain range of A, as calculated from 
dT4-b . This implies that for some values of A, the DPC scheme 
with the choice of a cannot lead to nonnegative achievable 
rate. From a practical perspective, whenever J(a, A) < 0, it 
loses no generality to replace it with J(a, A) = 0. 

IV. Application to Quasi-Static Fading BC 

If we treat the interference S in the channel model (Q]i as a 
source image, the robustness results obtained in the preceding 
sections can be immediately applied in digital watermarking 
with resizing or transmission over fading channels. Alterna- 
tively, we can treat both the signal X and the interference S 
as coded messages, then such a two-layer coding scheme can 
be useful in the fading BC. In this section, we illustrate that 
the robustness property of the DPC scheme can be useful for 
quasi-static fading BC without transmit CSI. 

A. Background Overview 

The Gaussian BC has been investigated since the seminal 
paper of Cover in 1972 [5]. It is shown that for scalar Gaussian 
BC, superposition coding achieves a rate region that dominates 
that of time division, and actually yields the capacity region 
[6], If the channel inputs are vector- valued, the vector Gaus- 
sian BC is generally non-degraded, and superposition coding 
ceases to yield optimal performance. On the other hand, the 
DPC scheme has been shown to maximize the throughput of 
the Gaussian vector BC [7]. This observation has stimulated 
a series of subsequent work on multi-antenna BC [8], [9], 
[10]. Recently the achievable rate region obtained by the DPC 
scheme in [7] has been shown to be the capacity region [11]. 



A central prerequisite of the aforementioned results is that 
the encoder has full access to CSI. It is shown in [12] that, if 
the transmit CSI is noisy, then the high SNR growth rate of the 
channel throughput is significantly reduced compared to that 
for perfect transmit CSI. For fading BC with ergodic fading, 
when the encoder possesses no CSI, neither the capacity 
region nor the maximum channel throughput is known. Some 
achievable rate regions are obtained in [13], [14], and it is 
conjectured therein that Gaussian inputs achieve the capacity 
region. 

The requirement of transmit CSI appears even more crucial 
for quasi-static channels. Consider a scalar BC. When trans- 
mit CSI is available, the encoder then knows exactly which 
decoder is degraded, possibly changing from one coding block 
to another. Superposition coding therefore can be utilized, such 
that in each coding block the "less noisy" decoder performs 
successive decoding, and the resulting achievable rate region 
is the capacity region conditioned upon the fading realization. 
Consequently, the outage capacity region can be identified 
[15]. 

However, if transmit CSI is absent, little progress has been 
reported. The main difficulty appears to be that the lack 
of transmit CSI prevents the encoder from making efficient 
utilization of superposition coding, because there is generally 
no intelligent way to identify which decoder is less noisy, 
even for the scalar case. Generally, the quasi-static fading 
BC without transmit CSI is non-degraded, and belongs to the 
class of mixed channels as defined in [16]. For such channels, 
no computable single-letter characterization of their e-capacity 
regions, i.e., outage capacity regions, has been identified [17]. 

B. Two-User Case 

Consider the two-user quasi-static fading BC model in 
which 

Y i = y/AlX + Z i , i= 1,2. (17) 

The additive noise Z{ ~ CN(0, 1), and the input X has 
an average power constraint p. The fading coefficient Ai is 
known at the corresponding decoder i, but not at the encoder. 
We assume that Ai has PDF pi(a) for a 6 [0, oo) and that 
(A±, A2) are independent and remain constant over the entire 
coding block. In the sequel, the inverse cumulative distribution 
function (ICDF) of Pi(-), denoted by G;( ), will be of use. 
For every t € [0, 1], Gi(t) is defined as the infimum such that 
Pr[A 4 < Gi(t)]=t 

For any given target outage probability vector e := 
(£1,62) € [0, l] 2 , we can define the e-outage capacity region 
e out (p,e) as the union of all rate vectors R = (R 1 ,R 2 ) that 
can be achieved with the outage probability for decoder i no 
larger than ej, i = 1,2 [15]. To simplify the notation and 
discussion, we assume Gi(ei) > G 2 (e2)- 

A conceptually simple scheme is time-division (TD) coding, 
with time-sharing factor (/z, 1 — p.) and power allocation factor 
(771, 772)- This yields the boundary of the e-outage achievable 



rate region 

& d *> a (p,e) := (J (< Pa (^M,m,e),i?2 dpa (p,M,^,6)), 

jji>0,i=l,2 

(18) 

where 

R\ pa (p,/x,r?i,e) := ;ulog[l + r^G^ei) • p] 
i?2 dpa (p,Ai, 772,e) := (1 - fi) log[l + 77 2 G 2 (e 2 ) • p] 
OTi + (1 - = 1- 

It turns out that we can do better than H tdpa (p, e), if we ap- 
propriately utilize the robustness property of DPC as revealed 
in Section [Till We split the input into X = X\ + X2, with 
X x ~ eW(0,7p), X 2 ~ CX(0,(1 - -y)p), and independent. 
The message for decoder 2 is encoded as X 2 simply chosen 
from a Gaussian codebook and the message for decoder 1 
is encoded as X\ treating X2 as interference using the DPC 
scheme. 

From Section [HI] for target rate vector R = (i?i,i? 2 ), 
decoder 1 experiences outage if 

exp(i?i) — 1 , _ 

Ai < — ' or log (1 + jAip) < Ri. 

IP 

On the other hand, decoder 2 treats X\ as additive noise, 
therefore it experiences outage if 

( A2P+I \ 
log -. — 7 < R2. 

\A 2 ~fp+ 1/ 

So the boundary of the e-outage achievable rate region for the 
DPC scheme is 

76[0,1] 

where 

R d i PC (p,l,£) ■= log[l + 7 C? 1 (ei)p] 

R * {p ^ ] = log l 1+ 1+7^2) p )- 

We observe that 3l td P a ( j o,e) and 3? dpc (p,e) can also be 
interpreted as achievable rate regions of a BC without fading 
in which 

% = y/Gi{€i)X + Z t , i= 1, 2. (20) 

Specifically, 3l tdpa (p, e) is achieved by time-division schemes 
with power allocation, and since Gi(ei) > G 2 (e 2 ), 3l dpc (/3, e) 
is achieved by superposition coding and therefore is the ca- 
pacity region. Consequently, we have that whenever G\ (ei) > 
G 2 (e 2 ), the e-outage achievable rate region of the DPC 
scheme, ft dpc (p,e), contains that of time division, 3? tdpa (/9,e) 
[18]. 



C. General K-User Case 

We can extend the two-user BC result to the general A'-user 
case. The channel model is 

(21) 



Zk, for k = 1, . . . , K, 

where Z k ~ CN(0, 1), and X has an average power constraint 
p. Similarly, each A k , following PDF pk{a) for a £ [0, oo) 
and remaining constant over the entire coding block, is known 
at decoder k but not at the encoder. The A k 's are mutually 
independent. We have the following result. 

Proposition 1: For the A-user quasi-static scalar fading BC 
without transmit CSI, OTT) . the DPC scheme achieves the 
boundary of the e-outage achievable rate region 

* dpc (p, e) := |J (i? dpc (p, 7, ei), • . . , R d K pc (p, 7, e K )) , (22) 



where 



log 1 



lkG k {t k ) ■ p 



K 

and y^7fc = 1, for j k > 0, k = 1, . . . , K. 

k=l 

In view of ft dpc (p,e), we again observe that if Gi(ei) > 
G 2 (e 2 ) > ... > G K (e K ), then 0i dpc (p,e) is exactly the 
capacity region of the BC without fading 



Y k = y/G k (e k )X + Z k , forfc = l, 



,A. 



(23) 



In light of this observation, we conjecture that the DPC scheme 
with appropriate ordering of the encoders is optimal in terms 
of the outage achievable rate region. 

Conjecture 1: For the A-user quasi-static scalar fading BC 
without transmit CSI, (f2Tb . if the A decoders are ordered such 
that Gi(ci) > G 2 (e 2 ) > ... > G K {e K ), then 3v dpc (p,e) is 
the boundary of the e-outage capacity region C out (p,e). 

Proof of Proposition Q} The proof essentially follows the 
same idea of the two-user case. Fix the power allocation 7. 
For decoder k, rewrite the channel as 

Y k = ^A~ k X k + Xi + (V^X^ X j + Z k), (24) 

i> k j<k 

where X k denotes the encoded signal for decoder k. Let X k be 
mutually independent and X k ~ C3\f(0, jip) for k — 1, . . . , A. 
The message for decoder k is encoded as X k , by the DPC 
scheme treating J2i> k x i as transmit interference, and treating 

(y/A~k Y,j<k X 3 + Z k) as noise- 
Following the same coincidence argument as in Section HTT1 
we can extend the robustness property of the DPC scheme 
to the channel d24l >. That is, for target rate R k , if we choose 
in the DPC scheme, the outage probability 



= 1 - e 



-Rk 



is Pr 



A k : A k < 



as if the transmit 



lfc ■ ^ k ~ 7fcP -(e«fe-l)(E 3<fc 7i)pJ 

interference Yli>k x i i s known at the decoder k. Equivalently, 
for a given target outage probability e k for decoder k, the 
maximum achievable rate R k should satisfy 

e Rk - 1 

G k (c k ), 



which immediately gives rise to one point of the boundary 
3? dpc (e) for the fixed power allocation 7. The entire boundary 
3? dpc (e) then follows, as we exhaust all the possible choices 
of 7. 
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